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Abstract - We derive a general quantum fluctuation theorem that explicitly accounts for the back 
action of quantum measurements on reduced systems. This theorem encompasses most thermody- 
namically relevant fluctuation theorems based on two-time measurements. As specific examples, 
we analyze the quantum Jarzynski equality for non-unitary dynamics and derive a fluctuation 
theorem for systems undergoing thermal changes. In addition, we show how a sharpened Holevo's 
bound follows as a consequence of our new general theorem. 
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Thermodynamics and information theory are intimately 
connected. The most prominent evidence for this relation- 
ship is that the Clausius entropy pQ is given by the Shan- 
non information [2J in systems at thermal equilibrium [3] . 
In particular, Landauer's principle [US] illustrates that in- 
formation is a physical, measurable quantity. Thermody- 
namic work has to be performed in order to create or erase 
information. Landauer's principle can therefore be under- 
stood as a statement of the second law of thermodynamics 
in an information theoretic context. This is also true for 
Holevo's bound [5J, which limits the amount of classical 
information that can be encoded in a generic quantum 
system. Recently, nano-devices - for which these princi- 
ples are directly applicable - have become experimentally 
accessible [7] . These controlled quantum systems have ap- 
plications ranging from quantum simulation |8l9j , cryptog- 
raphy [TU], computing [PTrTLl] . to metrology [HUT?] . The 
main obstacles towards realization of such devices are con- 
trol noise and interactions with the environment, so a ther- 
modynamic study is necessary to fully understand their 
information theoretic properties. However, most nano- 
devices operate far from thermal equilibrium, so tools from 
non-equilibrium statistical physics are required. In recent 
years, formulations of the second law have been derived 
which are valid arbitrarily far from equilibrium. These 
so-called fluctuation theorems, in particular the Jarzynski 
equality |18j , enable the calculation of equilibrium quanti- 
ties from non-equilibrium averages over many realizations 
of a single process. They also encompass non-equilibrium, 



information theoretic generalizations of the second law. 
As Landauer's principle is a direct implication of these 
theorems [19] ■ one may ask whether Holevo's bound is 
also such a result. One complication in this case is that 
the approach to fluctuation theorems for quantum sys- 
tems is mathematically and conceptually more involved. 
Thermodynamic quantities, which are not given as state 
functions, cannot be assigned an Hermitian operator [20] . 
The proper formulation of quantum thermodynamics for 
non-equilibrium systems, especially quantum fluctuation 
theorems |3T], must therefore be treated with care. 

The purpose of the present note is twofold. In the first 
part we derive a general quantum fluctuation theorem that 
accounts for the back action of measurements on reduced 
systems (see [53] for a similar approach). To this end, 
we consider an experimental point of view; we assume 
that the system of interest is coupled to an environment 
which is experimentally inaccessible. Such measurements 
on open quantum systems are inherently incomplete, since 
they ignore environmental degrees of freedom. Informa- 
tion is lost that in principle could have been acquired by 
concurrent measurement of the reservoir. A general for- 
mulation of quantum fluctuation theorems must explicitly 
account for these effects. The integral fluctuation theo- 
rem we derive is applicable to arbitrary orthogonal mea- 
surements, for systems undergoing both unitary and non- 
unitary dynamics. We show that this result encompasses 
several integral quantum fluctuation theorems, and in par- 
ticular derive the quantum generalization of the fluctua- 
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tion theorem for thermal work [2"5] . 

In the second part we focus on an information theoretic 
consequence of the fluctuation theorem, a sharpened state- 
ment of Holevo's bound. Our derivation thereby illustrates 
a notable connection between quantum thermodynamics 
and quantum information theory. 

General quantum fluctuation theorem. Con- 
sider a time-dependent quantum system, S, with Hilbert 
space Hs and initial density matrix pt=o = po- Infor- 
mation about the state of the system is obtained by per- 
forming measurements on S at the beginning and end of 
a specific process. At t = a quantum measurement is 
made of observable A 1 , with eigenvalues a l m . Letting LT^ 
denote the orthogonal projectors into the eigenspaces of 
A 1 , we have A 1 = Y^m a m^m- Note that the eigenvalues 
{a^} can be degenerate, so the projectors {11^} may have 
rank greater than one. Unlike the classical case, as long 
as po and A 1 do not have a common set of eigenvectors - 
i.e. they do not commute - performing a measurement on 
S alters its statistics. Measuring a\ n maps po to the state 
lEnPoIEm/pm, where p rn = tr {n^p IIJ n } is the probabil- 
ity of the measurement outcome a l m . Generally accounting 
for all possible measurement outcomes, the statistics of S 
after the measurement are given by the weighted average 
of all projections, 

M i (p ) = J2^mP0lL- (1) 

m 

If po commutes with A 1 , it commutes with each LT^, so 
M 1 (po) = Y^m^-nflmP = Po an d the statistics of the sys- 
tem are unaltered by performing the measurement. Af- 
ter measuring a l rn , S undergoes a generic time evolution 
over time t, after which it is given by ¥* T (W m poW rn ) / p m . 
Here E T represent any linear (unitary or non-unitary) 
quantum transformation, which is trace-preserving and 
maps non-negative operators to non-negative operators. 
Further, we require that this holds whenever E T is ex- 
tended to an operation E T ® Ig (lg being the identity map 
on Hg) on any enlarged Hilbert space Hs <8> Hs- Such 
a transformation is called a trace-preserving, completely 
positive (TCP) map [24]. At time t = r a measure- 
ment of a second (not necessarily the same) observable, 
A 1 = J2 n a n^ni ^ s performed on S. The probability of 
measuring a ! n , conditioned on having first measured a^, 
is p n \m = tr{n^E T (U^poU^)} /p m . Accordingly, the 
joint probability distribution p rn ~>n reads 

Pm-Hi = Pm ■ Pn\m = ^ {U f n E T (ll^poU^) } . (2) 

We are interested in the probability distribution of possi- 
ble measurement outcomes, V (Aa) = (S (Aa — Aa„. m )}, 
where Aa„ i?Ti = a l n — a l m is a random variable determined 
in a single measurement run. Its probability distribution 
is given by averaging over all possible realizations, 

V (Aa) = ^ 6 (Aa - Aa„, m ) p m ^n ■ (3) 



To derive the integral fluctuation theorem we follow the 
standard approach and compute its characteristic func- 
tion, Q(s), which is the Fourier transform of V(Aa) [21] 

Q(s) = I d(Aa)P(Aa) exp(isAa) 

J (4) 

= tr {cxp (isA { ) E r (M\p Q ) exp (-isA))} . 

Choosing s = i, we obtain the general quantum fluctuation 
theorem 

(exp(-Aa)) = 7 . (5) 

The quantity 7 accounts for the information lost by not 
measuring the environment. It plays a crucial role in the 
following discussion and is given explicitly by 

7 = tr {cxp (-A 1 ) E r (M>o) exp (A'))} . (6) 

Similar fluctuation theorems of the form (exp (— E)) = y c \ 
have been derived in the context of classical feedback pro- 
cesses, where E is an entropy production. For these the 
right hand side, 7 c i, is commonly called the (classical) effi- 
cacy of the feedback protocol [3S] . In the present context 7 
can then be interpreted as a quantum efficacy correspond- 
ing to the observables A 1 and A f , and the TCP map E T . 
Note that 7 only takes the from of the classical efficacy, 
7d, and usually may not be regarded as its physical quan- 
tum analogue |26j . For an optimal choice of initial and 
final measurement, 7 = 1. These special measurements 
can be interpreted as optimally designed observables cor- 
responding to the TCP map E T , such that the fluctuation 
theorem reduces to the classical form without any quan- 
tum corrections. 

A complementary result relevant to the fluctuation the- 
orem is Jensen's Inequality, which states that for any con- 
vex function <p and random variable x, (4>(x)) > <f>((x)) 
[27] , Applying this to Eq. (J5J yields 

(Ao)>-Iog( 7 ). (7) 

For specific choices of thcrmodynamically relevant observ- 
ables A 1 and A , this relation can be understood as a gen- 
eralized formulation of the Clausius inequality. 

Quantum corrections to classical theorems. 

As an illustrative example, let us turn to the generaliza- 
tion of the quantum Jarzynski equality to generically open 
quantum systems, 

(exp(-/3WO)=exp(-/3AF), (8) 

where W is the work done on the system and AF the 
change in initial and final equilibrium free energy. We 
consider a bipartite quantum system that can be separated 
into a time-dependent system of interest, S, and a thermal 
environment, £>, with inverse temperature f3. The total 
Hamiltonian for S and B is 

H tot (t) = H s (t) ® I B + I s ® H B + h S B , (9) 
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where h$B denotes an interaction between S and B. Fur- 
ther, we assume that Hs(t < 0) = Hs(0), so at t = 
the total system will have relaxed to its thermodynamic 
equilibrium state, po = exp(—/3H tot (0))/Zo, where Z = 
tr {exp (— (3H tot (0))} is the partition function for the com- 
posite system. It is worth emphasizing that although the 
total system is in a Gibbs state, the reduced state is gen- 
erally not, p s (0) = tr B {p } ^ exp (-/3H S (0))/Z S (0) [28]. 

Unlike the classical case |29j . if only partial informa- 
tion (through measurement of S) is available, Eq. © is 
not strictly valid, though it can be corrected by mak- 
ing a concurrent measurement of the environment |30j . 
The present treatment proposes a generically valid scheme 
to determine quantum corrections to the classical result, 
which for open systems have only been properly treated 
in specific cases Usually it is assumed that the sys- 
tem Hamiltonian is parametrized by an external control 
parameter at, so the work performed on the system is 
W — tr j f£ dt "tf)§f Ptj- However, work is not a quan- 
tum observable, and is thus usually defined through an 
initial and final energy measurement 20,32,33 . We there- 
fore consider Aa n _ m = - £^J, where E^E^ are 
the measured energy eigenvalues of Hs{t) and Hg(0), re- 
spectively. Accordingly, we have A 1 = (3Hs (0) ® I B and 
A 1 = /3Hs{t) ®I B . Note that for generically open systems 
such measurements only determine the change of inter- 
nal energy, AE. A distinction of work, W , and heat, Q, 
only becomes feasible if additional information about the 
microscopic dynamics is accessible [3]. We therefore for- 
mulate the quantum analogue of Eq. flSJ) in terms of AE. 

The quantum efficacy 7 ([6]) for this protocol reads 

7 = exp(-/3AF) 

x tv{[pf(r) ® I B ] E T (M\ Po ) [pfiO)- 1 ®I B ])} , 

(10) 

where p e £(t) = exp(-0H s (t))/Z s (t) is the Gibbsian of 
the isolated system, and (iAF = — log(Zs(r)/Zs(0)) its 
associated change in equilibrium free energy. The TCP 
map E r is given by the unitary evolution of the total sys- 
tem, E T (-) = U T ,o ■ Ul , with U Tfi = T> J^dtH tot {t). 
Thus, as a consequence of Eq. ([5]) we have 

<exp(-/3(A£-AF))) = 

tr{[p^(r)®I s ] U Ti0 M\p ) [^(O)" 1 ®!^ t/| } . 

(11) 

where the quantity f3(AE — AF) is recognized as the 
change in the thermodynamic entropy of the system. In 
the thermally isolated limit, Q = 0, this expression re- 
duces to j8 (W - AF) found in Eq. flSJ). 

To analyze when Eq. (jlip reduces to its classical form, 
we write M^o) [^(O)" 1 ® l B ] = (Is + Xs) ® pb(0), 
where pb(0) = trs {po} and xs is a correction term gov- 
erned by hse [IB]- Then Eq. (jTTJ) can be written as 



where e is the quantum correction term to the classical 
form of the Jarzynski equality. It reads explicitly 



= tr { [p^(r) ® I B ] U r , [[Is ® Pb(0)] , E^ j0 ] } 
+ tr ® I B ] C/ r . [ X s ® p B (0)] < } ■ 



(13) 



When the interaction term h$B is sufficiently small, 
ps{0) = tr B {p } ^ Ps q (0) EE], and we have xs - 0. 
Likewise in this limit pb(0) oc exp (— (3Hb), which com- 
mutes with U+ ~ (fZf ® exp (— iIT B T)) . Hence both 
terms contributing to e vanish whenever the contribution 
of the interaction term can be neglected. This is the case 
for ultra-weak coupling or in the high temperature limit 
(see also |31j for an analysis in the context of Quantum 
Brownian Motion) . 

For situations in which only a subsystem of the total 
quantum system is accessible, Eq. (|13p yields an explicit 
formula for the quantum corrections to the classical form 
of the Jarzynski equality. To calculate these corrections 
perturbatively in h$B, one may use Suzuki's relation j34j 



d x V(t) = V(t) / ds V(s)- 1 d x A(s) V(s), (14) 



where V(t) is an operator satisfying V(0) = I, d t V = 
A(t)V(t) and A is an expansion parameter. This is done 
by applying Eq. (TT4"]) to cxp(— /3(iJ to t(0))) and U T) o, with 
A corresponding to the operator norm ||/is B ||. 

Fluctuation theorem for thermal changes. By 

appropriate choice of initial and final observables A 1 and 
A 1 , Eq. ([5]) reproduces several known quantum fluctua- 
tion theorems [2"0H2"2"i r32, 35 38]. As a new contribution, 
we consider a quantum system consisting of a system of 
interest and an environment with time-dependent temper- 
ature. For classical, deterministic systems this situation 
was considered in [23l[39]. To derive the quantum gener- 
alization of the fluctuation theorem proposed in |23j . the 
corresponding observables A 1 and A 1 need to be identi- 
fied. In the classical treatment one considers a generalized 
entropy, AX, given by 



AA(r ,r T ) = log (p^ro)) - log (p f (r T )) 



(15) 



(exp (—j3(AE — AF))) = 1 + e, 



(12) 



P'(r) represents the initial distribution of the system 
of interest, defined over phase space parameter T, while 
P f (r) is an arbitrarily chosen distribution, cf. [JO]. For 
a given phase space trajectory T t , AX is only deter- 
mined by its initial and final values, which directly en- 
ables the definition of its quantum analogue. Accordingly, 
we choose the pair of observables A 1 = — log(ps(0)) and 
A s = — log(cr), where ps{0) = trg{/? } is the initial dis- 
tribution of the reduced system, and tr is a final density 
matrix analogous to P l (T). The dynamics of the compos- 
ite system are unitary, described by the time-dependent 
Hamiltonian H to t(t)- It is instructive to assume that 
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Htot{t) = Hs + Hg(t) + hse- Moreover, we assume that 
the total system is initially prepared in a product state, 
p(0) = As(0) <g) jOg(O). This assumption is not restrictive 
as we initially perform a measurement on 5, thereby dis- 
entangling S from B. The TCP map describing the time 
evolution of S is given by tracing out the environment, 

E t (as) = tr B |{/ T ,o [fis ® Pb(0)] ^,o}' where ?7 T ,o is the 
time evolution operator associated with H tot (t). Equa- 
tion ([3]) takes the form 

(exp (—AX)) = 7 and 7 = tr |crE r (P )} , (16) 

where Po is the projector into the support of ps(0). We 
proceed by identifying h$B as an interaction between iS 
and only a subset of the degrees of freedom in the bath, 
B. That is, we assume that the bath Hilbert space can 
be decomposed into two components, T~L' B ® Hu, where 
Ti' B describes the degrees of freedom in direct contact 
with the system, while Hu is the rest of the universe. 
This separation becomes mathematically necessary in or- 
der to properly describe the physical setup. While %' B 
is a thermal bath, assumed to always be in a Gibbsian 
Pb> oc exp (ptHe' ) , there is an external agent living in 
Hit and varying the temperature of B' . If the interaction 
term hse is small, we have ps(0) ~ exp (— /3qHs)/Zs{0), 
with Zs(Q) = tr {exp (— /3oHs)}- Over time r, the in- 
verse temperature of the bath, j3t , is varied from /3o to /3 T 
according to some external protocol. Since Hs is time- 
independent, changes in the system's energy arc identified 
with thermal work |23U39j . The quantum fluctuation the- 
orem is obtained by choosing A 1 = — log(cr) , where a = 
exp(—(3 T H s )/Z s (T) and Z T = tr {exp (— (3 T Hs)}. Accord- 
ingly, the thermal work is defined as AX = (3 t (E t — P*)lo 
and we have 

(exp(-AX)) = tr{exp(-/3 T H s )/Z s (r)E T (I s )} . (17) 

As in the case of the Jarzynski equality, the non-unitary 
dynamics of the reduced subsystem lead to quantum cor- 
rections to the classical form. In Eq. (jTT)) they are fully de- 
termined by the TCP map E r acting on the identity in the 
reduced system, S. The classical limit is recovered with 
the help of the Born approximation, pit) ~ ps(t) ® ps(t). 
Within this limit it turns out that the solution of the 
master equation ip{t) = [H(t),p(t)} satisfies ips{t) — 
[H s + h cS (t),p s (t)], with h cS (t) = tr B {h SB [Is ® p B (t)]}. 
Since h eS (t) is Hermitian [41] , ps{t) is the solution of 
a master equation describing unitary dynamics, so that 
E T (I 5 ) = Is. The right hand side of Eq. (JTTJ) reduces to 
unity in the Born approximation, and therefore the clas- 
sical form is recovered in the weak coupling limit. 

Unital maps and the quantum efficacy. Let us 

consider general conditions for which 7 = 1 in Eq. (|16l) . 
For the sake of simplicity, assume ps(0) is full rank, so that 
Po = Is- In this case we have 7 = 1 for all TCP maps sat- 
isfying E T (I,s) = I5. Such maps are called unital quantum 



channels. The simplest unital channel is unitary evolution, 
E T (-) = U ■ U\ where U is a unitary operator. In fact, the 
unital channels are the set of affine combination of unitary 
channels, {J2 k r k U k ■ U\ : £> fe = l,7T fe G ffi} 02]. In the 
case of Hamiltonian evolution of a composite system, if 
[H(t),Ig <8> Pe(0)] = for all t then E T is unital. Such an 
evolution implies an interaction with no heat exchange, 
even though S and B may become correlated. Finally, 
we note that the choice of ps (0) and a in the definition of 
AX is incidental, in that (exp (~AX)} = 1 holds irrespec- 
tive of their choice. This is similar to Seifert's fluctuation 
theorem for stochastic systems [301113] • 

Sharpened Holevo's theorem. — The rest of the 
present note provides an example of how tools from quan- 
tum thermodynamics may benefit quantum information 
theory. Specifically, we use the general quantum fluctua- 
tion theorem ([S]) to derive a sharpened version of Holevo's 
bound. This bound sets a limit on how much classical 
information can be sent through a (noisy) quantum chan- 
nel. Let us consider a message composed of code words 
Wj that appear with probability nj. A messenger (Alice) 
attempts to transfer this message to a receiver (Bob) by 
encoding each word Wj in a quantum state and transmit- 
ting that state to Bob. We assume that Bob receives the 
state pj, which may have come through a lossy medium 
and therefore be different from the original state prepared 
by Alice. Bob attempts to infer the word Wj from the en- 
coding by making a generalized measurement of the state 
Pj. This corresponds to introducing a probe, initially in a 
pure state |0), and making an orthogonal measurement on 
the compound state pj <g> |0)(0| [U]- If {life} represents the 
set of orthogonal projectors corresponding to Bob's mea- 
surement, the probability of measuring Ilfc, given message 
wj, is given by 

n k]j = tr{ Pj ® |0><0|n fc > = tr{ Pj M k } , (18) 

where M k = (0|IIfc|0) are operators acting only on the 
encoding degree of freedom. Although the M k are non- 
negative and J^k = I, generally they are not projec- 
tors, ^ M k - Such a collection {M k } is called a posi- 
tive operator valued measure (POVM), and describes the 
most general measurement on a quantum system. The 
classical message distribution {^j}, output quantum en- 
coding {pj}, and POVM elements {M k } define a classical- 
quantum channel |45j . 

A proper measure of how well Bob decodes Alice's mes- 
sage is the mutual information between the message and 
measurement distributions, I = J2jk n i n k\j 1°§ ( 7r fc|.j/ 7r fc), 
where Tr k = y~]j nj n k \j is the overall probability of mea- 
suring Ilfc. Note that I = - TTjD(iT k \j \ \~K k ), where 

£>(7Tfc|jlKfc) = Sfe^fclj l°g (7rfc|j/vrfe) is the (classical) rela- 
tive entropy |46j . Hence / is a sum of non- negative terms 
and is if and only if n k \j = ix k for all k,j. That is, / 
vanishes only if all outcomes of the measurement are in- 
dependent of the encoded word, so that Bob always learns 
nothing about the message. 
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The probability of the message being Wj, conditioned 
on Bob measuring ITfc is, Hj\ k = xuj 7Tj / 7r k . We have 



1 = S({TTj}) + ^TTfc ^ %j\ k \0g', 



j\k: 



(19) 



where S({nj}) = — J2j 71 j ^°S n j is the Shannon informa- 
tion of distribution {nj}. Since xlog(x) < for \x\ < 1, 
with equality only for x — and x = 1, we observe that 
I < S({7Tj}), with equality if and only if -Kj\ k is or 1 
for all j, k. In other words, the mutual information / is 
thus at most S({iTj}), with equality if and only if Bob 
correctly decodes the message in every instance. If Alice's 
encoded states are not perfectly distinguishable (that is, if 
supp(pj) and supply) are not orthogonal spaces for some 
j 7^ then I can never equal S({iTj}), no matter what 
measurement Bob chooses to make. Holevo's theorem is 
then an upper bound for /, namely 



X 



S(p)-Y,*rS(pi)>i> 



(20) 



where S(p) = — trjplogp} is the von- Neumann entropy, 
and p = ^2jXjPj is the density matrix describing the 
statistics of the encoding given no knowledge of the mes- 
sage word. Heuristically, x can be considered as the uncer- 
tainty of the encoding with no knowledge of the message, 
minus the average remaining uncertainty given knowledge 
of the message. Note that if the encoded states are distin- 
guishable, i.e. pjPji = O for all j ^ j' , then x — 
so that with a proper measurement Bob may always cor- 
rectly decode Alice's message. 

We now show how Holevo's theorem (|2"0|) follows as a 
consequence of the general quantum fluctuation theorem 
([5]) . To do this we must appropriately choose initial state, 
evolution operation, and observables so that the random 
variable (Aa) averages to x~ I- In the language of the gen- 
eral fluctuation theorem, let the initial state po reside in a 
composite Hilbert space He ® Hv <8> Hm ■ He represents 
the encoding Hilbert space, which Alice prepares and Bob 
then measures, Hp is the probe Hilbert space accessible 
only to Bob, and Hm is a message Hilbert space. Note 
that Hm is n °t a real, physically accessible subspace, but 
rather a mathematical construction denoting the memory 
for the classical information of the message [13] . We have 



Po=^vr ift ® |0)(0|® 



(21) 



where the states \j), each corresponding to word Wj, form 
an orthonormal basis for Hm- We see that, with prob- 
ability TTj , po corresponds to the message state \j). The 
measured initial and final observables are 



.4' 



£ log (ft)® |0><0|® 



A { = - log Q5® \0)(0\)®I M 

- lo § ( 7r fc|i/ 7r fc) Uk b">0'l > 

k,j 



(22) 



where If~j = log (*'k\j/' 7r k) . Note that we are not observ- 
ing the original encoding set up by Alice, but rather its 
time evolved state after undergoing possibly noisy dynam- 
ics in a quantum channel. To apply the fluctuation the- 
orem ([SJ, we start with the output of the noisy channel 
and perform the two measurements, A 1 and A 1 , instanta- 
neously after each other. The TCP map crucial for Eq. ([SJ 
is thus the identity map E(p) = p. A 1 commutes with 
po, so p T = E T (Af 1 ( / 9o)) = po and the measurement of 
A is carried out on the same state. After computing 
(Aa) = tr {(A { - A l )po} = x - I, Eq. Q becomes 



X - I > - log (7) . 



(23) 



where the corresponding quantum efficacy is given by 

7 = tr {cxp {-A f )po cxp (A 1 ) } . (24) 



Equations (|23|) and (|24|) constitute the sharpened Holevo's 
bound as a consequence of the general quantum fluctua- 
tion theorem ([S]). Indeed, our new bound is tighter than 
the usual inequality (f2"0)) . in the sense that the correction 
term, — log (7), is always non- negative. Consider 

Po exp [A 1 ) = J2 KiPi cxp (- log( ft )) ® |0)(0| ® 
3 

= X)7r i ^®|0)(0|®|i)(j|, 

(25) 

where Pj is the projector into the support of Pj. We can 
rewrite Eq. flU} with Eq. ([2SJ as 

7 = tr {exp (- A 1 ) po cxp (A 1 ) } 
= tri exp (-A f )J2^3Pj ® |0> <0| ® \ 



< 



3 y 



exp log(p<8>|0)<0|) 



X>,n t )}. 



(26) 



The inequality in Eq. (|26[) is justified by noting that 
exp (log (p (g> |0)(0|) + J2k I k >3 ' n*;) is non-negative and 
Pj <g> |0)(0| is a projection operator. We now use a state- 
ment of the Golden- Thompson inequality [TT. 15 : for any 
Hermitian operators A and B, we have tr {exp (A + B)} < 
tr {exp (A) exp (B)}. Note that in the present case, A and 
B arc both logarithms of bounded Hermitian operators, 
and are only bounded from above, though the Golden- 
Thompson inequality still holds . Accordingly, we have 

7 = tr {cxp (-A f ) po cxp (A 1 ) } 

< *3 tr \ exp (log (p ® |0)(0|)) ■ cxp ( £ 7 feJ n fc ) 1 



^2*3 ix\ (p® |0)(0|)5> fc|j /7r fc n* 



(27) 
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From the definition irk = 7r j 7r fc|j we finally obtain 

7<tr|p®|0)(0|^n fe | = 1, (28) 

which shows that —log (7) > 0, as desired. It is worth 
mentioning that our derivation does not invoke the strong 
subadditivity of von Neumann entropy, and thus relies on 
weaker statements than the ones commonly required to 
prove Holevo's bound [50] . 

Concluding remarks. — We developed a general 
framework for quantum fluctuation theorems by explicitly 
accounting for the back action of quantum measurements. 
This enabled us to analyze quantum corrections to clas- 
sical theorems and derive a new fluctuation theorem for 
thermal quantum work. Finally, we showed that quantum 
mechanical formulations of the second law are intimately 
tied to quantum information theory by deriving Holevo's 
bound as a consequence of the general quantum fluctua- 
tion theorem. 
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